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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: HUSE, WILLIAM D. 

(ill) NUMBER OF SEQUENCES: 75 

(tv) CORRESPONDENCE ADDRESS: 

ft! SS5f?? E ?;/ P !F T I' SCHR °EDER. BRUEGGEMANN & CLARK 
\l] ^LtftSjg"* STREET V SUITE 200 

(D) STATE: CALIFORNIA 

(E) COUNTRY: UNITED STATES 

(F) ZIP: 90071 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION • 

(A) NAME: CAMPBELL, CATHRYN A. 

(B) REGISTRATION NUMBER: 31,815 

(C) REFERENCE/DOCKET NUMBER: P31 8882 

(ix) TELECOMMUNICATION INFORMATION- 

(A) TELEPHONE: 619-535-9001 

(B) TELEFAX: 619-535-8949 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

^(A) LENGTH: 7445 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: circular 

9 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
AATGCTACTA CTATTAGTAG AATTGATGCC ACCTTTTCAG CTCGCGCCCC AAATGAAAAT 60 
ATAGCTAAAC AGGTTATTGA CCATTTGCGA AATGTATCTA ATGGTCAAAC TAAATCTACT 120 
CGTTCGCAGA ATTGGGAATC AACTCTTACA TGGAATGAAA CTTCCAGACA CCGTACTTTA 180 
GTTGCATATT TAAAACATGT TGAGCTACAG CACCAGATTC AGCAATTAAG CTCTAAGCCA 240 
TCTGCAAAAA TCACCTCTTA TCAAAACCAG CAATTAAAGG TACTCTCTAA TCCTGACCTG 300 
TTGGAGTTTG CTTCCGGTCT GGTTCGCTTT GAAGCTCGAA TTAAAACCCG ATATTTCAAG 360 
TCTTTCCGGC TTCCTCTTAA TCTTTTTGAT GCAATCCGCT TTGCTTCTGA CTATAATAGT 420 



43 



CAGGGTAAAG ACCTGATTTT TGATTTATGG TCATTCTCGT TTTCTCAACT GTTTAAAGCA 
TTTGAGGGGG ATTCAATGAA TATTTATGAC GATTCCGCAG TATTGGACGC TATCCAGTCT 

aaacatttta ctattacccc ctctggcaaa ACTxcrrrrG caaaagcctc tcgctatttt 

GGTTTTTATC GTCGTCTGGT AAACGAGGGT TATGATAGTG TTCCTCTTAC TATGCGTCGT 
AATTCCTTTT GGCGTTATGT ATCTGCATTA GTTGAATGTG GTATTCCTAA ATCTCAACTG 
ATGAATCTTT CTACCTGTAA TAATGTTGTT CCGTTAGTTC GTTTTATTAA CGTAGATTTT 
TCTTCCCAAC GTCCTGACTG GTATAATGAG CCAGTTCTTA AAATCGCATA AGGTAATTCA 
CAATGATTAA AGTTGAAATT AAACCATCTC AAGCCCAATT TACTACTCGT TCTGGTGTTT 
CTCGTCAGGG CAAGCCTTAT TCACTGAATG AGCAGCTTTG TTACGTTGAT TTGGGTAATG 
AATATCCGGT TCTTGTCAAG ATTACTCTTG ATGAAGGTCA GCCAGCCTAT GCGCCTGGTC 
TGTACACCGT TGATCTGTCC TGTITCAAAG TTGGTCAGTT CGGTTCCCTT ATGATTGACC 
GTCTGCGCCT CGTTCCGGCT AAGTAACATG GAGCAGGTCG CGGATTTCGA CACAATTTAT 
CAGGCGATGA TACAAATCTC CGTTGTACTT TGTTTCGCGC TTGGTATAAT CGCTGGGGGT 
CAAAGATGAG TGTTTTAGTG TATTCTTTCG CCTCTTTCGT TTTAGGTTGG TGCCTTCGTA 
GTGGCATTAC GTATTTTACC CGTTTAATGG AAACTTCCTC ATGAAAAAGT CTTTAGTCCT 
CAAAGCCTCT GTAGCCGTTG CTACCCTCGT TCCGATGCTG TCTTTCGCTG CTGAGGGTGA 
CGATCCCGCA AAAGCGGCCT TTAACTCCCT GCAAGCCTCA GCGACCGAAT ATATCGGTTA 
TGCGTGGGCG ATGGTTGTTG TCATTGTCGG CGCAACTATC GGTATCAAGC TGTTTAAGAA 
ATTCACCTCG AAAGCAAGCT GATAAACCGA TACAATTAAA GGCTCCTTTT GGAGCCTTTT 
TTTTTGGAGA TTTTCAACCT GAAAAAATTA TTATTCGCAA TTCCTTTAGT TGTTCCTTTC 
TATTCTCACT CCGCTGAAAC TGTTGAAAGT TGTTTAGCAA AACCCCATAC AGAAAATTCA 1680 
TTTACTAACG TCTGGAAAGA CGACAAAACT TTAGATCGTT ACGCTAACTA TGAGGGTTGT 1740 
CTGTGGAATtf CTACAGGCGT TGTAGTTTGT ACTGGTGACG AAACTCAGTG TTACGGTACA 1800 
TGGGTTCCTA TTGGGCTTGC TATCCCTGAA AATGAGGGTG GTGGCTCTGA GGGTGGCGGT I860 
TCTGAGGGTG GCGGTTCTGA GGGTGGCGGT ACTAAACCTC CTGAGTACGG TGATACACCT 1920 
ATTCCGGGCT ATACTTATAT CAACCCTCTC GACGGCACTT ATCCGCCTCG TACTGAGCAA 1980 
AACCCCGCTA ATCCTAATCC TTCTCTTCAG CAGTCTCACC CTCTTAATAC TTTCATGTTT 2040 
CAGAATAATA GGTTCCGAAA TAGGCAGGGG GCATTAACTG TTTATACGGG CACTGTTACT 2100 
CAAGGCACTG ACCCCCTTAA AACTTATTAC CACTACACTC CTGTATCATC AAAAGCCATG 2160 
TATGACGCTT ACTGGAACCG TAAATTCAGA GACTGCGCTT TCCATTCTGG CTTTAATGAA 2220 
GATCCATTCG TTTGTGAATA TCAACCCCAA TCCTCTGACC TGCCTCAACC TCCTGTCAAT 2280 
GCTGGCGGCG GCTCTGGTGG TGGTTCTGGT CGCGGCTCTG AGGGTGGTGG CTCTGAGGGT 2340 
GGCGCTTCTG AGGGTGGCGG CTCTGAGGGA GGCGGTTCCG GTCGTGGCTC TGGTTCCGGT 2400 
GATTTTGATT ATGAAAAGAT GGCAAACGCT AATAAGGGGG CTATGACCGA AAATGCCGAT 2460 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



GAAAACCCGC TACAGTCTGA CGCTAAAGGr inrrrr,^ 

GCTCr - iTrr GCTAAAGGC AAACTTGATT CTCTCGCTAC TGATTACGGT 

GCTGCiATCG ATGGTTTCAT TrrTrAr W ^ 

TTTCAT TCCTGACGTT ICCCGCCTTG CXAAICCTAA TGGTCCTACT 
CGTCATTTTG CTGGCTCTAA TTrrr-....,- '^GTGCTACT 
ITAAIGAATA a-JT ^GCCAAATG GCTCAAGTCG GTGAGGGTGA TAATTCACCT 

7 ATAITTACCT TCCCTCCCTC ^™ ATGTCGCCCT 

mGTGTTTA CCGCTGGTAA AGCATATGAA TTTTCTATTG ATTGTCACAA AATAAACTTA 
XTCCCXCCTG XCXXXCCCXT XCTXTXAXAX CTXCCCACCX TTATCTATGT a™ 
TTTGCTAACA TACXCCCXAA TAAGCAGTCT TAATCATCCC AGTTCTTTTC GG^ 
-XXAXXCCC XXXCCXCCCX XXCCXXCXCC XAACXXICTX CGGCTATCTG CXTACXXTXC 
TTAAAAAGGG CTXCCGXAAG ATAGCTATTG CTATTTCATT CITTCTTGCT CTTATTATTC 

cccxxaacxc aaxxcxxcxc cgxxaxgxcx cigaiattag coctcmita ccci ™™ 

™ ;:r attctcccot t ™ ™ 

TCTCTCTAAA GGCTGGTATI TTCATTTTTG ACGXTAAACA AAAAAXCGTI TGTTATTTGG 

attgggataa ataataiggc iGTTTArrrr cr^r^. tcxxatxicc 

".IITATXTT ctaactggca aattaggctc tggaaagacc 

CXCCTIAGCG TTCGTAAGAT TCAGGATAAA ATTGTArrr,- , TGGAAAGACC 

AITCTAGCTG GGTCCAAAAT AGCAACTAAT 

GTTGATTTAA CGCTTCAAAA CCTCrrrr». 

CCTCCCGCAA CTCCGCAGGT TCGCTAAAAC GCCTCGCCTT 

OXACAAXAC CCCAXAACCC TTCTATATCT CAXXXCCXXC CXATXGGGCG CCCXAAXGa" 
KCTACCATC AAAATAAAAA CCCCXXCCXX CXXCXCCAXC ACXCCCCXAC XXCCXXxl 
ACCCCTTCn CCAAXCAXAA CCAAACACAC CCCAXTAXXC ATTGCTTTCT ACAXCCxl 
= AX GCGATATTAT TTTTCTTCTT CACCACXXAX CXAXXGXXCA XAAACAGGCC 

zzi r cMcA tcttctttai mc T ~ r — 

™ ' ICmmCT XGCCXCXCCC XAAAXXACAX 

A™ T TATGC CGATrCTCM ""^ «™« "CCTXXAX 

rr™ cccaiaicai aciaaacacg c ™ c *™ 

XCCCCXCXXX AXXCXTAXXX AACCCCXXAX XXAXCACACC CXCCCXAXXX CAAACCAXXA 
AAXTXACCTC AGAAGAICAA GCITACIAAA AT.T.t™.. 

«-™ctaaa aiaiaxxtga AAAACTTXTC accccxtcit 
reicncccA xtggaittcc aicagcaixx acatat.™ .... 

Mer™... . ACAXAIACTT AIAXAACCCA ACCIAACCCC 

<ZZZ C TCA0ACCttT ™" AATXCACXAX TCACXCTXCX 

a™ TCCCTATCTT ™" AXTAAXXAAX 

ZZ I " AGGITATICA -CXCXTXCC 
TXAAAAAAC GXAAXXCAAA XCAAAXXCX AAAXCXAAXT AAXXXXCXTX XCXXCAXCXX 

ZZc x 1 ™ 0 CTCAGCTAAI ~ T ~ xcccccaxxx 

xT™ C «*™» «™ec«. AXCXAAAACC 

" GTAXAXXCAX CXCACCXXAA ACCXCAAAAX CXACCCAAXX XCXXTAXXXC 
XCXXTXACCX GCXAAXAAXX XXCAXAXCCX XCCXXCAAXX CCXXCCAXAA XXCACAACXA 



2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

.3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 



:z: zz7 atoa ato ~« « 

^Z" GTCCTTTCTT ICTTCCCCAA .TAGTGAAAG 

mTAAAATT AATAACCTTC GGGGAAAGGA TTTAATACCA GTTCTGGAAT TGTTTGTAAA 
„ ~ CMATCTAIT A ™ ™= *™ 

Z IAm TA0AIAACCT TCCTCMTO ^ ™™= 

AA^AGCAG ATA TO ATTG AGGCTTTGAT ArTTCAGCTT GACGAAGGTG ATGCTTTAGA 
TTTTTCAm GCTCCTCGCT CTGAGCGTf-r r.'/^-. ""-nTAGA 
CCTCAGCirr rm, CTCAGCGTCC CACTGTTGCA GGGGGTGTIA AIAGTGACCG 

CCTCACCICT GTTTTATCTT CTGCTCGTGG TTCGTTdCr-T .rm,. 
... ilui ™GCT ATTTTTAATC CCCATCTTTT 

Z2Z ZT C " TAAACAC ™ AAAATATTGT CTGTGCCACG 

TATTCTTAGG GTTTCAGGTG AGAAGCGTTG TAIGTCTGTT GGCCAGAATG TGGGTTTTAT 

" ZZ7 a M ~ u ~ T ™ ~ 

ICAAAATCTA GGTATTTCCA TGAGCGTTTT TCCTCTTrrA at,™ 

TCTfPATA tt ao TCCTGTTGCA ATGGCTGGCG GTAATATTGT 

TCTGGATATT ACCAGCAAGG C CG ATArTTT 

^CGATAGTTT GAGTTCTTCT ACTCAGGCAA GTGATGTTAT 

AGXAAXGAA ACAAGTA1TG GTAGAAGGCT TAATTTGCGT CATGGACAGA 

™i :™ M - aene t — — ~ 

™Z " ICTTTAGCTC ccocictcai icc — 

« T CCATACTACC AGGGGGGGGG 

GtGTGGTGGT TACCCGCAGC GTGACCCCTA GAGTTGGGAG CGCCCTAGCC GGGGGTGGTT 

™ r cmT ~ e " t ™ — » 
zzi T Tr mc cgaittagt ° ctttacgoca c ~ — <™ 

~ Lc A0TCGCCCAI CCC ™ "™ 

^ ^ r AATAGTGGAC TCTTCTTCCA AACTCGAACA ACACTCAACC 

™ CATTTATAAC CGAmT = CC GGACCATCAA 

~ C MCAAACCAG "-CAAG TCTCTCAGGG 

™ A r CAAIC A °™ GGTCTCGCTG GTGAAAAGAA AAACCACCCT 

ZT ca CCTCIcecco csottcccc »» 

AC^GAGGTT TCCCCACTCG AAACCGCCCA GTGAGCGCAA CCCAATTAAT GTCACTTACC 
™ A °<— TTATCCTTCC „ATC ^TCXGOAA 

~ ATAACAATTT CACACCCGTC AC^GCACT CCCCCTCCTT TTACAACCTC 

r CCIGGC ^ — - AAGTGAAACA 

" c tZ ~ C *™«** ACCGTTACTC TITACCCCTC TGACAAAAGC 

™ " r ctco aotcaoccct atotcccca e " Mn «* — 

ZZZn ! CTGCTAAGCC CAAGTCCTAC 

TCACTACATT GGGTAGGGTT CGGCTATCGT AGTACTTATA GTTCGTGCTA GGATAGGGAT 



4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 
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TAAATTATTC AAAAACTTTA. CCACCA.GGC TTCTTAAGCA ATAGCCAACA CGGCCGCACC 
CArCGCGCTT CCCAACACTT CCCCACCCTG AATGGCCAAT GCCCCTTTGC CTCCTTTCCC 
GGAGGAGMG CGCTGCCGGA AACCTCCCTG GAGXGGGATG TTGGTGAGGG GGATACGGTG 
GTCCTCCCCT OAAGTGGGA GATGCAGGGT TACCATCCGC GGATGTAGAG CAAGGTAAGG 
TATCCCATTA CGGTCAATCC CCCGTTTGTT GCGAGGGAGA ATCCCACCGG TTGTTACTGG 
GTGAGATTTA ATCTTGATGA AACCTGGCTA GAGGAAGGGG ACACCCGAAT TATTTTTGAT 
GGCGTTCCTA TTCGTTAAAA AATGAGCTGA TTTAACAAAA ATTTAACGCG AAITTTAACA 
AAATAXTAAG CTTTACAATT TAAATATTTC CTTATACAAT CTTCCTGTTT TTGGGGCTTT 
TCTGATTATC AAGCGGGGTA CATATGATTG AGATGGTAGT TTTACGATTA CCCTTCATCG 
ATTCTCTTGT TTGCTCCAGA CTCTCAGGCA ATCACCTCAT AGCCTTTGTA GATCTCTCAA 
AAATAGGTAG CCTCTCCGGC ATTAATTTAT GAGGXAGAAG GGTTGAATAT CATATTGATC 
CTGATTTCAC TGTGTGGGGG CTTTCTCAGC CTTTTGAATC TTTACCTACA CATTACTCAG 
GGATTGGArr TAAAATATAT GAGGGTTCTA AAAATTTTTA TCCTTGCGTT GAAATAAAGG 
CTTCTCCCGC AAAAGTATTA CAGCGTCATA ATGTmtGG TACAACCGAT TTAGCTTTAT 
CCTCTCAGCC TTTATTGCTT AATTrrrrrA , Tr , 

MTTrTGCTA ATTCTTTGCC TTGCCTGTAT GATTTATTGG 

ACGTT 

(2) INFORMATIOH FOR SEQ ID NO: 2: 

(l> SEQUENCE CHARACTERISTICS • 

») TYPE: nucleic acid 
C) STRANDEDNESS : both 
(D) TOPOLOGY: clrculi" 

(xl) SEQUENCE DESCRIPTION: SEQ ID N0-2- 
AAIGCTACTA CTATTACTAC AATTGATGGC ACCTTTTCAG CTCGCGGGGC AAATGAAAAT 
ATAGCTAAAC AGGTTATTGA CCATTTCCGA AATGTATCTA ATGCTCAAAC TAAATCTACT 
CGTTCGCAGA ATTCGGAATC AACTGTTACA TGGAATCAAA CTTCCAGACA CCCTACTTTA 
GTTGCATATT TAAAACATGT TCAGCTACAG CACCAGATTC AGCAATTAAG CTCTAAGCCA 
TCCGCAAAAA TCACCTCTTA TCAAAAGCAG CAATTAAAGC TACTCTCTAA TCCTGACCTG 
TTGGAGTTIC CTTCCCGTCT GCTTCCCTTT GAAGCTCGAA TTAAAACGCG ATATTTCAAG 
TCTTTCCCCC TTCCTCTTAA TCTTTTTGAT GCAATCCGCT TTCCITCTGA CTATAATAGT 
CAGGGTAAAG ACGIGATTTT TGATTTATGG TCATTCTCGT TTTCTGAAGT GTTTAAAGCA 
TTTCAGCGGG ATTCAATGAA TATTTATGAG GATTCGGGAG TATTGGACGC TATGGAGTGT 
AAACATTTTA CTATTACCCC CTCTGGCAAA ACTTCTTTTG CAAAAGCCTC TGGGTATTTT 
CGTTTTTATC GTCGTGTGGT AAACGACCGT TATGATAGTG TTCCTCTTAC TATGGCTCGT 

AArrccrm ggcgttatgt atgtggatta gttgaatgtg gtattcgtaa atctgaactg 



6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7445 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



"ZZ "7°™ ™™ ™m 

zz ~: ccacttctta ^ *™ 

CXCCXCACGC CAAGGCXXA^ X^T 

-AXGGGGX XCXXCXCMG ^ ™<™« ™*TG 

XCXACACCGX XCAX^ ™= 
GXCXGCGCGT r ^ " OTCACTT <»neccn AXCAXXCAeC 

XGXGGGGGX GGXXGGGGGX AAGXAACAXC CAGCAGGTCG GGGAXXXCGA CACAATTTAT 
CAGGCGATGA TACAAATCTr r^-r^, CACAATTTAT 

GAAAGAXGAG «ZS aTxT 

-GGAXXAG GXAXXTXACC CCXXXAAXCG ^ XGCCXXCGXA 
GAAAGCGXCT rr.~, AAACIXCCXC AIGAAAAACI CXIXAGXCCX 

ZZ " 1 CTACOTCCT ICCCATCCT0 Tcmcccrc 

zz zz cm cccacccmi — » 

AXTGAGGXCC ZT CCCAACXAXC «™GAAGG XGTXXAAGAA 

AfTCACCTCG AAAGCAAGCT GATAAA rrri T .r,.^. 

TTTTTGGArA ™ GATAAACCGA TACAATTAAA GGCTCCTTTT GGAGCCTTTT 

TTTTTGGAGA TTTTCAACGT GAAAAAATTA TTATTCrrAA tt. - 

TATTCTCACT CCGCTGAAAC TCTTCAAatt WIMCmC 
TTTACTAArr Tr TGTTCAAAGT TGTTTAGCAA AACCCCATAC AGAAAATTCA 

TTTACTAACG TCTGGAAAGA CGACAAAACT TTiriTrrrr 

CTGXGGAAXG CXAGAGGGGX X CT ™ ^*** < *^X^ 
XCGGTXCGTA rrrr ^GTAGXTICX AGXGGXGACG AAAGXCAGXC XTAGGGXACA 

ZZ c IATCOTCAA MKMeeK GGGXGGCCGX 

zz zr* coctc<:cgct acta " cctc ~- — 

ZZ ™: ZT cm — -~ 

CAGAAXAAXA GCXT ^""^ ^ ™»«m 

CAAGCCACTC IAG0CA ° CGG ™»«" 

ZZ " AAC ™ mC AAAAGCCATC 

zz zr TAAAncAeA »~ — 
zz zz zz ~~ — ~« 

OCGGCXXCXG ZT^ ° CC0CCTCTC AG GCXCCXGG CXGXGAGGGX 

ZZ ZZ ZZ "~ ~ «~ 

GAAAACCCCC XACAGXCXGA ZZ AAXAAGCGCG CXAICACCCA AAAICCCGAI 
GCXGCXAXCG AXGGxZ ™*» 
GGXCATTXIC GXGgTgT ZZ gT^ 

TTAAICAAXA ArrrT " CCCAAATC 0CT1 ^"CG GXGAGGGXGA XAAXICAGGX 

ZZ G^x AIAmACCT ICCCTCCCTC ™- 

TXXGXGXXXA GGGGXGGXM AGGAXAXGAA XTXXGXAXXG AXTGXGAGAA AAXAAAGXXA 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



48 

ZZZ 1™°™ TCTmAIAI ™ caccT ™ tctatct 

TACTCCCTAA TAAGCAGTCT TAAXCAXGCC AGTTCTTTTG GGTATTCCCT 

^ TTTCCTCCCT TTCC " CTCC TAACmS " °»™** CTTACTTTTC 
TTAAAAAGGG C7TCGGTAAC ATAGCTATTG CXATTXCArX CTTTCTTGCT CTTATTATTC 
CGGTTAAGTG AATTCTTGTG GGTTATCTCT CTGAXAXXAC CCCXCAATXA GGCXGXG^ 
TTGTTCAGGG TCXXGAGXXA ATTCTCGGGT GTMTCGGGT XGGCXGXTXX TATCTTATTC 
TCTCTGTAAA GGC7GCTATT TTCATTTTTr irrrr* > ' ""ATTC 

rrCATTTrrG ACGTTAAACA AAAAATCGTT TCTTATTTGC 
ATXGGGAXAA AXMXAXGCG XGXXXAXXXT GTAACTCGCA AATTAGGCTC TGGAAAGACG 
CTCGTTAGCG XTCGXAAGAX TGAGGATAAA ATTGTACCTC GGTGCAAAAT AGCAACTAAT 

CXXGAXXXAA CGCTTGAAAA cgtgccggaa gtggggaggt tggctaaaac ccctcgcgtt 

CTTACAATAC CCGATAACGG TTCTATATCT CAXTXGGTXG CTATTGGGCC CGGTAATGAT 
XCCXACGAXG aaaataaaaa ccgcttcctt GTTCTCCATC AGTGGGGTAG ttggtttaat. 
ACCGGTTCTT GGAATGATM GGAAAGAGAG CCGAXXAXTG AXTGCXTXCT ACATGGTCGT 
AAATTACGAT GGGAXAXXAX mXGTTGTT CAGGACTTAT CTATTGTTGA TAAACACCCC 
CGTICTGCAX TAGCTGAACA XGXTCXXXAX TGTCCTCCTC TGGACACAAT TACTTTACCT 
TTTGTCGGTA CTXXAXAXXC TCITATTACT GGCTGGAAAA TGGCICTGGC TAAAXXACAX 
GTTCGCGTTC TTAAATATGG CGATTCTCAA TTMGGCCIA CTGTIGAGCG TTGGCTTTAT 
ACTGCTAAGA ATTTGTATAA CGCAXATGAT ACTAAACAGG CTTTTTCTAG TAATTATCAT 
XCCCGTGXTT ATTCTTATTT AACCCCTTAT TTATCACACG CTCGGTAITr CAAACCATTA 
"XXXACGTC AGAACATGAA GCTTACTAAA ATATATTTCA AAAAGTTTTC ACGCGTTCTT 

GTCTTGCGA TTCGATTTCC ATCAGCATTT ACATAXAGXX ATATAACCCA ACGXAAGGGG 
GAGGTTAAAA AGGXAGXGXG TCAGACCTAT CATTTTGATA AATTCACTAT TGACTCTTCT 

AICIAACCTA TCCCIAICTT ""^ CTAAGCGAAA ATTAATTAAT 
«CGAGGAXX TACAGAAGCA AGCTTATTCA CTCACAIAIA XXGAXXXAXC TACXGXXTCC 
™*AAC GTAATTCAAA TGAAAITCTI AAATGTAATT AATTTTCTIT TCTTGATCTT 
™xga xgxxgxtxtc GXCAOCXAAX XCAAAXCAAX AAXTCCCCXG XGCCCCAXXT 
X^AAGTXOC TAXXGAAAGC AAXCAGGCGA AXCGCTXAXT OXTTGXGCCG AXGXAAAAGC 
TACXCXXACX GXAXAXXCAT CXGACCXXAA ACCTGAAAAX CXACGCAAXT XCTTXAXXXC 

IZZZ GCtAATAATr TIGATATGCT XTCAGAACIA 

" AIATOAICA AIKCCATCA IC ~ T " TC — * 
T^AAXXCC GCXCGXXCXG GXGGXTXCXT XGXXCCGGAA AAIGAXAATC XXACXCAAAC 

TTTXAAAAXT AAXAAGCXTG GGCCAAAGCA XTXAAXACGA GXTGXGGAAX XGXXTGXAAA 

GTGXAATACX XCXAAAXGGX GAAAXCXATX AXCXAXTGAG GGCTCTAAXC XAXTACXXGT 

XAGXGCACCX AAAGAXAXTX XAGAXAACCX TCGXGAAXXC CXTTCXACXG XXGAXTXGGG 



2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 
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AACTCACCAC ATATTGATTG AGGGTTTGAT ATTTGAGGTT CAGCAAGGTG ATGCTTTAGA 

rrrrrcArrT gctgctggct gtgagggtcg cactgttgga ggcggtgtta atactgagcg 

CCTCACCTCT GTTTTATGTT CTGCTGGTGG TTGGTTGGGT ATTTTTAATG GCGATGTTTT 

AGGGCTATCA gttcgcgcat taaagactaa tagccattca aaaatattgt ctgtgccacg 

TATTCTTACG C7TTCAGGTC AGAAGGGTTC TATCTCTGTT " GGCCAGAATG TCCCTTTTAT 
TACTCGTCGT GTGACTGGTG AATCTGCCAA tgtaaataat ccatttcaga cgattgaggg 
TCAAAATGTA GGTATTTCCA TGAGCCTTTT TCCTGTTGCA ATGGCTGGCG GTAATATTGT 

TCTGGATATT accagcaagg ccgatagttt GAGTTCTTCT actcacgcaa gtgatgttat 

TACTAATCAA AGAAGTATTG CTACAACGG7 TAATTTGCGT GATGGACAGA CTCTTTTACT 

CGGTGGCCTC agtgattata. aaaacacttc tcaagattct ggcgtaccgt tcctgtctaa 

AATCCCTTTA ATCGGCCTCC TGTTTAGCTC CCCCTCTCAT TCCAACGAGG AAAGCACGTT 
ATACGTGCTC GTCAAAGCAA CCATAGTACG CGCCCTGTAG CGGCGCATTA AGCGCGGCGG 
GTGTGGTGGT TACGCGCAGC GTGACCGCTA CACTTGCCAG CGCCCTAGCG CCCGCTGCTT 
TCGCTTTCTT CCCTTCCTTT CTCGCCACGT TCGCCGGCTT TCCCCGTCAA GCTCTAAATC 
GGGGGCTCCC TTTAGGGTTC CGATTTAGTG GTTTACGGCA CCTCGACCCC AAAAAACTTG 
AmGGGTGA TGGTTCACGT AGTGGGCCAT CGCCCTGATA GACGGTTTTT CGCCCTTTGA 
CGTTGGAGTC CACGTTCTTT AATAGTGGAC TCTTGTTCCA AACTGGAACA ACACTCAACC 
GTATCTCGGG CTATTCTTTT GATTTATAAG GGATTTTGCC GATTTCGGAA CCACCATCAA 
ACAGGATTTT CGCCTGCTGG GGCAAACCAG CGTGGACCGC TTGCTGCAAC TCTCTCAGGG 
CCAGGCGGTG AAGGGCAATC AGCTGTTGCC CGTCTCGCTC GTCAAAAGAA AAACCACCCT 
GGCGCCCAAT ACGCAAACCG CCTCTCCCCG CGCGTTGGCC GATTCATTAA TGCAGCTGGC 
ACGACAGGTT TCCCGACTGG AAAGCGGGCA GTGAGCGCAA CGCAATTAAT GTGAGTTAGC 
TCACTCAm GGCACCCCAG GCTTTACACT TTATGCTTCC GGCTCGTATG TTGTGTGGAA 
TTGTGAGCGG ATAACAATTT CACACGCCAA CGAGACAGTC ATAATGAAAT ACCTATTGCC 
TACGCCAGCC GCTGGA1TGT TATTACTCCC TGCCCAACCA GCCATGCCCG AGCTCGTCAT 
GACCCAGACT CCAGAXATCC AACAGGAATG ACTGTTAATT GTAGAACGCG TCACTTGGCA 
CTGGCCCTCG TTTTACAACG TCGTGACTGG GAAAACCCTC GCCTTACCCA ACCTTAATCG 
CCTTGCAGAA TTCCCTTTCG CCACCTGGCC TAATAGCCAA GAGGCCCGCA CCGATCGCCC 
TTCCCAACAC TTGCGCAGCC TGAATGGCGA ATGGCGCTTT GCCTCGTTTC CGGCACCAGA 
AGCCGTGCCC GAAAGCTGGC TGGAGTGCGA TCTTCCTGAG GCCGATACGG TCCTCGTCCC 
CTCAAACTGG CAGATGCACG GTTACGATGC GCCCATCTAC ACCAACGTAA CCTATCCCAT 
TACCGTCAAT CCGCCGTTTG TTCCCACCGA GAATCCGACC GGTTGTTACT CGCTCACATT 
TAATGTTGAT GAAAGCTGGC TACAGGAAGG CCAGACGCGA ATTATTTTTG ATGGCGTTCC 
TATTGGTTAA AAAATCAGCT GATTTAACAA AAATTTAACC CGAATTTTAA CAAAATATTA 



4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 



ACCTTTACAA TTTAMTArT TGCTTATACA ATCTTCCTCT TmGGGGGT TTTCTGA1TA 

TCAACCCGGG T^CATATCAT -nr.r, rCTGATTA 

CTTTCC-CCA r GTTTrACCAT TACCGT7CAT CGA ™^ 

I™ : r AGG CAATGACCTG ATAGCCTTTG tagatctctc a — 

ATCAGCTAGA ACGGTrGAAT - atcatattga tgg ™ 

^TCTCTCCG GCCTTTCTCA CCCTTTTGAA TCTTTACCTA CACATTACTC AGGCATTGCA 
TTTAAAATAT ATGAGGGTTC TAAAAATTTT TATCCT.CCC TTGAAATAAA GGCTTCTCCC 
= ^ TAATGTTTTT GGTACAACCG ATTTAGCTTT ATGCTCTGAG 
CTTTATTGG TTAATTTTGC TAATTCTTTG CCTTGCCTGT ATGATTTATT GGATGTT 
(2) INFORMATION FOR SEQ ID NO:3 : 

(l) CHARACTERISTICS- 

(B) TYPE: nucleic acid 
(C STRANDEDNESS : both 
(D) TOPOLOGY: circular 



6900 
6960 
7020 
7080 
7140 
7200 
7260 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO- 3 - 

AATGCTACTA CTATTAGTAG AATTGATGCC ACCTTTTCAG CTCGCGCCCC AAATGAAAAT 
ATAGCTAAAC AGGTTATTrA rnrrr^ ^ AAT 
GGTTCGGAC, T C "™GGA AATCTATCTA ATGGTCAAAC TAAAICTACT 

Z^Z MCW,Btt TCGAATGAAA CCCTACTTTA 

ZZZ r CATGT TGACCTACAC A0C — CTCTAAGCCA 

n^Z 2 TCAMACGAC ™« -gtgaggtg 

™T WHB GOTCCCT " ^CCAA TTAAAACGCG ATAITTCAAG 

CACGGTAAAC ^ ™~ " 

«AAC AGCTGATTTT TCATTTATCC XCArrCTGGX TCTCTCAACT GTTTAAAGCA 

m^„ AITCMICAA tATTTATCAC GATTCCGCAG TATTGGACCC TATCCAGTCT 
"AGmiTA CTAITACCCC CTCTGCCAAA ACTTCTTTTC CAAAAGCCTC TCCCTATTTT 
«™ GTCGTCTGGT AAACGAGGGT TATGATAGTG TTGCTCTTAC TATGCCTCGT 
Z^Z C0C0TIAICT AICTG " m -»» CTATTCCTAA AXCXGAAGXC 

" ZgT" CCCTIACTTC ™* — ™ 

CTAIMTCAC CCMITCTTA AAATCCCATA AGCTAATTCA 

caa™ acttgaaatt aaaggatgig aaggggaati tagtacxcgt tgtgctcttt 

CTCCTCAGCG CAACCCTTAT TCACTCAATC ACCAGCTTTC TTACCTTGAT TTCGGTAATG 

ZZ2 AmCTCTO AT0AACCTCA °— ™ 
"™ ~ c TcmcAAAG CG ™ ™- 

CCGATTTCGA CACAATTTAT 

CAGGGGATGA TACAAATCTC CGTTGTACTT TCTTTCCCCC TTGCTATAAT GGGTGGGGGT 



60 
i20 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



CAAAGATCAG TGTTTTAGTG TATTCTTTrr rn-r 

-CAXXAC CTATTTTACC C ^ ST" 

cmacccxgx cxaggcgxxg cxagccxcgx x™ TC CmA ° TCCT 

CGAXCCCGCA AAAGCGGCCX XXAACXCCCX C^C ^ 
TGCGXGGGGG AXCGXXGXXG XCAXXGXCCG ^ ' 

~» ^ccx ::zzz ™ tgtt7aacaa 

TTTTTGGAGA TTTTCAACGT r A* » a GGCTCCTTTT GGAGCCTTTT 

-cxcacx z:z x^r " cctttact ™ c 

-ACXAACG TCXGGAAAGA CC^ ^ ^ « 

====== 

= - r= = = 

ATTCCGGGCT ATACTTATAT CAACCCTCTC GACGGCACTT ATCCGCCTGG TACTGAGCAA 
AACCCCGCTA ATCCTAATCC TTCTCTTGAC r.rr^ TACTGAGCAA 

LTTGAG GAGTCTCAGC CTCTTAATAC rrrr.TPT^ 
CAGAATAATA GGTTCCGAAA TAGCCAGGGG ar.-r-r TTTCATGTTT 
CAAGGCACTG ACCCCGTTAA ^ ^^ATTAACTC TTTATACGGG CACTCTTACT 

TATGACGCTT Zcr " — ^G 

TATGACGCTT ACTGGAACGG TAAATTCAGA GACTGCGCTT TCCATTCTCr rrrr 

= = rr — - = = 
==r = === 

GATTTTCATT ATGAAAAGAT CGCAAACGCT AATAACCCrr rr 1T , 

GAAAACCrrr tap.o, AATAAGGGGG CTATGACCGA AAATGCCGAT 

tiAAAACGCGC TACAGTCTGA CGCTAAAnrr a*»,~~._ 

CCICCTAlre irrrrrrr AAACXXCAXX CXCXCGCXAC XCAXXACGGX 

«.TGCTAXtC ATGGXTXCAI IGGXGACGXX XCCCCrr-rrr ™. 

GCTGAIXXXC GXGGCXGXAA XXGCCAA.^ ™^™T 
HAAXCAAIA .rrZT GCXCAAGXCC CXGAGCCXGA XAAXXCACCX 

™ ™: ~ t — — «~ 

"cccxcoxc xgxxxg^ ^ 

xi iii- TCTTTTATAT GTTCCCACCT TTATGTATGT ATTTTCTACC 

TTTGCTAACA TACTGCGTAA TAAGGAGTCT TAATCATGCC Am J! ATTTTCTACC 
TATTATTrro ~- iAATCATGCC AGTTCTTTTC GCTATTCCCT 

TATTATTGCG TTTCCTCCCT TTCCTTCTGG TAACTTTCTT r«v-, 
TTAAAAAGGC rrrrvv-r. TAACTTTCTT CGGCTATCTC CTTACTTTTC 

1TAAAAAGGG CTTCCCTAAG ATAGCTATTC CTATTTP a tt ™. 

GGCTTAACTC AiTTfrr CTATTTCATT GTTTCTTGCT CTTATTATTG 

TTGTTCAGQS ~« « 

XCXCXGXAAA GGCXCCIAXr £££ ~« 

axxgggaxaa axaaxaxgg" x™ ^ r r TOTAITTC0 

WAACTCGCA AATTACGCTC TGGAAAGACC 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 



TCACGATAAA AmTACCT0 GGTCCAAAAT AGCAACTAAT 
Z C- CTCGGGAGGT TCCCTAAAAC GCCTCGCGTT 

OXAGAAXAC C.CAXAAGGC „„„„„ GATTTGCTTG CXAXXGGGCG CGGTAATCAT 
™GGAXG AAAATAAAAA CGCCTTGCTT GXXCXGGAXG AGXGGGGXAG ™l 
ACCCGTTCTT GGAAXGAXAA GGAAAGAGAG CGGAXXAXXG AXXGGXTXGX ACAXCCTCGX 

GGXXCXGCAX T^" T ™* T ~ 

™ "T" TmGTTTAT TCTCCTCCTC 

" Z TCTTAmCT ° CCTCCAAAA TC ™ C ™— 

GTTGGCGTTG TTAAAXAXGG GGATXGXGAA XXAAGCCCXA CXGXXGAGCG XXGGGXXXAX 
ACTGGTAAGA ATTTGTATAA crrir^n^ ^w,CTTTAT 
XCCGGXGXXT AxZ ACTAAACAGG GXTXXXGXAG XAAXXAXCAX 

TGGGGXGXTX AIXGXXAXXT AAGGGGXTAX XXAXGAGAGG GXGGGXAXTX CAAACCAXXA 

™: ;t cm gcttactaaa ai ™ ~ ™ 
= aggxagxcxc ^ ^ ^ " 

CAGCGXCXTA ICAGACCTAI "XXXXGAXA AATXGAGXAX XGACXGXXGX 

CACGGXGXXA AXGXAAGGXA XGGGXAXGXX XTGAAGGAXX GXAAGGGAAA AXXAAXXAAX 
AGGGAGGAXX XAGAGAAGGA AGCTTArTCA CXCACAXAXA XXGAXXXAXG XACTGTTICC 
^AAAAAAG GXAAXXCAAA XGAAAXXGXX AAAXCTAAXT AAXXXXGXXX XGXXGA^X 
TGXXXGAXGA XGXTGXTXXG CXCAGGXAAX XGAAAXGAAX AAXXCGCCXC XGCCCCAXXX 

= := ~ — — — 

■ Trm XAXAXXCAX CXGACGTTAA ACCXGAAAAX GTACCCAAIX XCTTXAITTC 

r^ r™" CCXXCCAXAA XXGAGAAGX 

rZTcc Z A ~ A AITCCCATCA ICTC — ™™ 

l CTGGTTTCIT TOTCCCCAA ~= TTACXCAAAC 

X~ C ""^ mAAMCCA XGXXXGXAAA 
CICTAAXACX XCXAAAXGCX GAAATCTATT AICTATTCAC GGCICTAAXC XAXXAGXXCX 

= ^ XAGATAACCT XGGXCAAXXC GXXXCXAGXG " 
Z^X :™ AGGCTIIGAI AmGAOTI — = AXGCXTXAGA 

g^Z 2, CICAGCCT0C CA ™™ u axc ^ 

ZT" tIaaR - et A ™= ««™ 

™ f" 1 — " AAAATATTCT CXGXGCCACG 

a™ Z AGM0CCTTC TAICTCTOT <~° -™ 

~ AATCTCCCAA TCTAAA ™ T ™" -XXCAGGC 

™ A r mccA TcAcccmT tc ™ 

TGXGGAXAXX ACCAGCAAGG GCGAXAGXXX GAGXXGXXGX ACXCAGCCAA GXGAXGXXAX 



3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 



TACTAATCAA AGAACTATTC GTAGAAGGCT TAATTTGCCT CATGGACAGA CTCTTTTACT 
CCCTCCCCC ACTCATTATA AAAACACTTC TCAAGATTCT CGCCTACCGT TCCTCTCTAA 

ATACCTCCTC ^ ^ ^ 

= :;:;r c ™ — 

rccnncn cccttccttt gtggII CCCCCTCOT 

GGGGGGTGGG TO GGG^ G^ ^ 

T7TAGCCTTC GGATTTACTC CTITACGGCA CCTCGACGCG AAAAAACTTG 
ATTTCCGTGA IGGTTCACGT ACTCCCrr.T AAAAAACTTC 
«T AGTGGGCCAT CCCCCTGAIA CACGCTTTTT CCCCCTTrri 

= = r— - = 

™ A0CTCTTGCC — - ~ AAACCACCCT 

GGCGCCCAAT ACGCAAACCG GGTGTGGGGG GGGGTTGGGG GATTCATTAA TGCACCTGCC 

;™ i::r o ° ™ — - ™ 

~ ATAACAATTT ^ ^ "~ 

~a jsz z zz zr* goccctcgti mc " cCTc 

AAGCACTATT GCACTGCCAC TCTTACCCTT 

CCACGTPrrr TC "*««I ACTGTTTACG CCTGTCGCAA AAGGGGAGGT 

I™ ~ ;™ c ~ ~ ~ 

TCAGGCCCCC TCACCAGCCC GG^ 

TACTCCCTCA cZT " TCCTACACTC 

l ^ " CACCCTGCCC GACCTACATC 

TGTACT CAGCAACACC ^CGGAAATGt 
TGTACTAGTC GATCCTACCC CTArn.r ^<-*AAiCT 

GACCCTCCTA ACGCTCCATT 

GCTTCGGCTA TCCTAGTAGT T^I 

TTTACGAGCA A«JT^ ^ ^AAATT ATTCAAAAAG 

~ ™™ accCAICCC CCT — 

CGCAAAGCTC CCTGGAGTGC GATCTTCCTC 
GGCAGATCCA CCGTTACGAT 

AICGGGGGTT x™- GCGCCCATCT ACACCAAGGT MCCTAICCC ATTACCGTCA 

CAGAAICCCA Wa ° nk ™™» C 
ATGAAAGCTC GCTACAGGAA GGCCAGACCr riim 

AAAAAATCAC CTCATTTAAC AAAAATTTA ^ 
AATTTAAATA Tr^Z ^ «*«n» AACAAAAIAT TAACGTTTAC 

AATTTAAATA TTXGGTTATA CAATCTTCCT GTmTGGGC CTTTTCTCAT TAXGAAGGGG 



5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 



7380 
7440 
7500 
7560 
7620 
7680 



GGTACATATC ATTGACATGC TAGTTTTACC 
CAGACTCTCA GGGAATGAGC TGATAGCCTT C« 

«™, ™ A 0 ; ^ ~ c ; ca — ™ 

CGGCCTTTCT C'CCCTTrrr ATATCATATT GATGCTGATT TCACTGTCTC 

~g cZT. ™ ~ AC -~° 

ATTACAGCGT CA^ ^ ^ * 

U ^GGTACAAC CGATTTAGCT TTATGCTCTr a,v~» 
GCTTAATTT7 GCTAATTCTT rrrr^rn 1TATGCTCTG ACGCTTTATT 

CTAATTCTT TGCCTTGCCT CTATCATTTA TTGGACGTT 
(2) INFORMATION FOR SEQ ZD N0:4 . 7729 

(i) SEQUENCE CHARACTERISTIC*; • 

(A) Ii2IGTH^7SS7^ £ STICS: 

(B) TYPE: nucllic ?cid PairS 
C) STRANDEDNESS? boJS 

(D) TOPOLOGY: circular 

(xi) SEQUENCE DESCRI PTI0N: SEp ID NQ . 4 . 
AATGCTACTA CTATTAGTAG AATTCATrrr *,~™~ 

4TirrT AATTGATGCC ACCTTTTCAG CTCGCGCCCC AAATGAAAAT 

ATAGCTAAAC AGGTTATTGA rrii-rrr^ iljAAAAT 
TTATTGA CCATTTGCCA AATCTATCTA ATGGTCAAAC TAAATCTACT 

TCCCCAAAAA TGACCTCTTA TCAaT^ 

™ ™ A ~ ~ rr tcctcacctc 

TCTTTCGGGC rrrrrr™ GAAGCTCGAA TTAAAACCCC AIATTTGAAG 

™ Z TCmTTCAT CCMTCCCCT ™» 

~ A™ TCATTCTCCT m ™ 

— * CTATTACCCC ^ ^ ^ 
rrT , ,.,. PA ^ ACTTCTTTTG CAAAAGCCTC TCGCTATTTT 

= ™ — — — - 

ATGAATCTTT CTACCT^ ^ ~« 
XCTXCCCAAC GTCCTGACTG ^ ^ «~ ~ 
CAATGATTAA AGTTGAAATT AM^Z 

CTCGTCAGGC ri4 , AAGCCCAATT TACTACTCGT TCTGGTGTTT 

~ ~: ir^ agcagctttc ™~ ~« 

TGTACACCGT « ^ ~ GCGCCTGGTC 
GTCTGCGCCT C^C^T «< 
CAGGCCATGA TA^ <*™ 

-CATCAC TG^l A ^ ~™ «~ 

TTTTAGTG TATTCTTTCG CCTCTTTCGT TTTAGGTTGG TGCCTTCGTA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



c™ ™T ™"* — = — ctttagtcct 

ZZZ CTACCCTCCT tcccaicctc ™™ 

CGATCCCCCA «*««, ^ CCAAGCCTCA 

TGCGTGGCCG ATCGTTGTTG TCATTCTCCT ^ ATATCCCTTA 
AITCACCTCG TCArr ° TCCC ^GAACTATC JjGTATCAAGC TCTTTAAGAA 

Z « ^ GATAAACCGA TACAATTAAA GGCTCCTTTT GGAGCCTTTT 

rmTGGAGA TTTTCAACCT GAAAAAATTA TTATTCGCAA TTCCTTTAGT TGTTCCTTTC 
TATTCTCACT CCGGXGAAAG TGTTGAAACT TGTTTAGCAA AACCCCATAC AGAAAATTCA 
TTTACTAACG TCTGGAAAGA CGACAAAACT TTArATr^, AGAAAATTCA 
QTfyrm iTC "CAAAACT TTAGATCCTT ACGCTAACTA TCAGGGTTCT 

CTCTGGAATC CTAGAGGGGT TGTAGTTTGT AGTGGTGAGC AAAGTCAGTG TTACGGTACA 
TCGCTTCCTA TTCCGCTTGC TATCCCTGA. TTACGGTACA 
,.„„ TATCCCTGAA AATGAGGGTG CTCGCTCTCA GCGTGCCGGT 

TCTGAGGGTG GCCGTTCTCA CCrrrrrm 'W.UWT 
ATTCCGGCGT 4T .^ ACTAAACCTC CTCAGTACGG TCATACACCT 

AT Z CAACOTCTC CACCGCACTT AICCCCCTCG T — 
AACCCCCCTA ATCCTMTCG TTCTCTTGAG GAGTCTCAGC CTCTTAATAG TTTCATGTTT 

CAGAATAATA GGTTCCGAAA TAGGCAGGCC GCATTAACTC TTTATACGGG CACTGTTACT 

~ MCTrAITAC — ~ 

IT " GCMCGC TMAITCAGA GACTGCCCTT TCCATTCTCG CTTTAATGAA 

"CCATTCG TTTGTGAATA TCAAGGCCAA TCGTCTGACC TGCCTCAACC TCCTC^ 

CGCGCTTCTG "~» " <~ 

GGCGCTTCTG AGCGTCGCGC CTCTGAGGGA GGCCGTTCCC GTGGTGGGTC TGGTTCCGCT 

GATTTTCATT ATCAAAAGAT CGCAAACGCT AATAAGGGCG CTATCACCCA J£Z 
GAAAACGCGC TACAGTCTGA CCCTxa^ "aa^CCGAT 
GCTGCTATCG ATCGTTTCAT ^ AAACTTCATT CTCTCGCTAC TCATTACGCT 
CCTCATTTT* ™- TGCTGACGTT TCCCGCCTTC CTAATGGTAA TCCTGCTACT 

ZT" GTGACGGTCA TAATTCACCT 

mA^AATA ATTTCCGTCA ATATTTACCT TCCCTGCCTC AATCGCTTCA ATGTCGCCCT 
" ~ ACCATATGAA TTTTCTATTG ATTGTCACAA AATAAACTTA 
~0 TOncCCn TOTTTATAT CTTCCCACCT TTATGTATCT AXmCTACC 
™* TACTCCGTAA TAAGCAGTCT TAATCATCCC ACTTCTTTTC GGTATTCCGT 
ZZ r CTCOT «— GTTACTTTTC 

a™ ^ZT CCTCmOT ™- "™ 

^ ~ TTACCCTCTC ACTTTCTTCA 

AAAGCCTCCT <*TTCCCTCT TTTTATCTTA TTCTCTCTCT 

ZttT ™ ACCT ™ AC * eiTOB « TGGATTCCGA 

™ ^r AI CTCT — ACGCTCGTTA 

GCCTTCGTAA GATTCAGGAT AAAATTCTAC CTCGCTCCAA AATAGCAACT AATCTTGATT 



1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 



56 



" CCTCCCC ccttkctaa <— 
Z C ™ TA TGTGAmcc TT ™ gcccot - «™ 
XX=T" " CGGGTTG CTTGTTCTCC AICAGTCCCC TAC ™ 

~ CAGCGGAITA CGTAAATTAG 

CAXCGGAXAX ;ATTTTTCTT GXXCAGGAGX TATCTATTGT TCATAAACAG GGGCGXXGXG 
CAXTAGGXGA ACATGTTGTT XAXXGXGGXG GXGXGGAGAG AAXXACXXXA CCXXXXGXCG 
OXACXXXAXA TXCXCXXAXX AGXGGGXCGA AAAXGGGXGX CCCXAAAXXA CAXGTXGGGG 
XXGXXAAAXA XCGCGAXXCX GAATXAAGGC CXACXGTXGA GGGXTGGGXX XAXAGXGGTA 
AGAAXXXGXA XAACCCAXAX GAXAGXAAAG AGGGXXXXXG XAGXAAXXAX GAXXCCGGXG 
TTXAXXGXXA XXXAAGCGGX XAXXTAXGAC ACGGXGGGXA XXXCAAACGA XXAAAXXXAG 
GXCAGAACAX GAAGGXXAGX AAAAXAXATX XGAAAAAGTT XXGAGGCGXX CXXXGXCXXG 
CGAXXGGAXX XGCAXCAGCA XXXACAXATA GXTAXAXAAG GCAAGCXAAG CCGGAGGXXA 
AAAAGGXAGX CXCXGAGACC XAXGAXXXXG AXAAAXTCAC XAXXGAGXGX XGXCAGGGXC 
TXAATCXAAG CXAXGGGXAX GTTTTCAAGG AXXGXAAGGG AAAAXXAAXX AATAGCGACG 
ATTXAGAGAA GCAAGGXXAX XGAGTCAGAX AXAXTGAXTX AXGXAGXGXX XCGAXXAAAA 
AAGGXAAXXC AAAXGAAAXX GXTAAAXGXA AXXAAXXTXG XTXXGXXGAX GXXXGXTXGA 
XGAXCXXCXX XXGGXCAGGX AAXTGAAAXG AAXAAXTCGC GXGXGGGCGA XXXXGXAAGX 
XGCXAXXCAA AGCAAXCAGG CCAATCCCXX AXXGXXXGXC CCCAXCXAAA AGGXACXGXX 
AGXGXAXAXX CAXCXGACGX XAAACCXGAA AAXCXACGGA AXXXCXXXAX XXCXGXXTXA 
CGXGGXAAXA AXXXXGAXAX GGXXGGXXCA AXXGCXXCGA XAAXXCAGAA GXAXAAXCCA 
AACAAXCAGG AXXAXAXXGA XGAAXTCCCA XGAXCXGAXA AXCAGGAAXA XGAXGAXAAX 
XGCGGXCGXX GXGGXGGXIX GXXXGXTGCG GAAAAXGAXA AXCXTACXCA AACXXXXAAA 
AXXAAXAAfc XXCGGGGAAA GGAXXXAAXA CCACXXCXCG AAXXGXXXGX AAAGXCXAAI 
ACIXGXAAAX CGXCAAAXGX AXTATCXAXT GACGCCTCTA ATCXATTAGX XGXIAGXGCA 
CCXAAAGAXA TXTXAGAXAA CCTTCCTCAA XTGGITICIA CTGXTCAXTT GGCAAGXGAG 
CAGAXAXTCA XXGAGGCm CAXAXXXGAG GXTCACCAAG GTGATGCTXT AGAXXTXTCA 
TTXGCXCCIC GGTGXCACGG TCGCACTGTI GCAGGCGGXC XXAAXAGXGA CCGCCXCAGC 
XCTGXIXXAT GXXGXGGXGG XCGTTCGTTC CGXATTTTTA AICGCGAIGX XTTAGGGCTA 
TCAGXTGGCG CATXAAAGAC XAATAGCCAI XCAAAAAXAX XGXCXGXCGG ACGTATTCTT 
AGGGXTXCAG GXCAGAAGCG XXCXATGTCX GIXGGCGAGA AXCXGGCXTX XATTAGXCCT 
GCXGXGAGXC GXGAAXCXCC CAAXGTAAAX AATCCAXTTC ACACGATTGA GCGXCAAAAX 
GXACGXATTT GCAXGAGGGX XXXTCCTGXX GGAAXCCCIC GCGGXAAIAX XGTXGXCGAX 
AXXAGCAGGA AGGCCCAXAG XXXGAGTXCX XGXAGTCAGG GAAGXGAXGX XATTACXAAX 
CAAAGAAGXA XXGGXACAAG GGTXAAXXXG GGXGAXGGAG AGAGXGXXXX ACXGGCXGGG 



3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
.3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 



CTCACTGATT ATAAAAACAC TTCTGAapat -™ 

TTAATCGCCC --rr™. 1 CTGGCGTAC CGTTCCTCTC TAAAATCCCT 

TTAATCGCCC .CTCTTTAO GTCCCCCTCT CATTCCAAGG ACCAAACCAC CTTATACGTC 
C^TCAAAC CAACCATAGT AGGGGGGCTG TACCCGCGCA TTAAGCCCCG C«G ^ 
GGTTACCCCC AGCGTCACCG CTACACTTCC CACCCCCCTA GCGCCCGCTC CTrrCGCTTT 

cttcc cttcc rrrcrccccA ggttcgcggg ctttcgccgt gaaggtctaa Z^Z 

CCCTTTAGGG TXCCGATTTA GTGCTTTACG GGAGCTCCAC CCCAAAAAAC 2^ 

r TGooc e ~" atagacgctt «~» -~ 

^CTATTCT TTTGATTTAT AAGGGATTTT GCCCATTTCG GAACCACCAT CAAACAGGAT 
mcGCCTGG TGCGGCAAAC CAGCGTGCAC CGCTTGCTGC AACTCTCTCA 
GTCAAGGGCA ATCAGCTGTT GCCCCTCTGC rrr„ CCGCCACCCG 
A„™ CCCCGTCTCG CTGGTGAAAA GAAAAACCAC CCTGGCGCCC 

AATACGCAAA CCGCGTCTCC CCGCCCCTTG GCCGATTCAT TAATCCACCT CCC^C 
GTTTCCCCAC TGGAAAGCGG GCAGTGAGGG CAACGCAATT AATGTCAGTT AGCTCACTCA 

ZHZ ttT 1 " actttatcct icgggctgct — 

CCGGTGGAT TGTTATTAGT GGGTGGGCAA CGACGCATGG CGGAGGTGTT CCCGCGATGT 

< ~ r ~ c ™ 

™g! aT CAAGCTCCAI CTCCCAGGAG 

™« Z r° acctacagcc t — c — <= 

™ tIZ ACACMACTC TACGGCTGGG — g 

CCCG TCACAAAGAG CTTCAACArr r/-./..™™_ 
CTCGGGGTrr " CAACAC0 GGAGAGTGTT CTAGAACGCG TCACTTGGCA 

ZZZ Z TmaAmc gaaaagggtg ggct — 

TtT^Z " CCAGCTGCCG TAATACCGAA GAGGGCGGCA CCCAXCCCCC 

TTCCCAACAG TTGCGCAGCC TGAATGGCGA ATCGCCCTTT GCCTCGTTTC CGGCACCAGA 
AGCGGTGCCG GAAAGCTGCC TCCACKOU r^,^. CGGCACCAGA 
CTQUA^ T0G * OTGC0 * TCTTCCIGAC GCCGATACCG TCCICCTCCC 

CTCAAACTCC CAGATCCACC CTTACGATCC GCCCATCTAC ACCAACCTAA CCTATCCCAT 
= CCGCCCTTTC TTCCCACGCA GAATCCCACC GCTTCTTACT 

~ C ATCGGGTrCC 

^ rr GCT CAmMCAA """"" CAAAATATTA 

ACGTTTACAA TTTAAATATT TCCTTATACA ATrrrrrr^ 

AAAUA ATCTTCCTGT TTTTGGCGCT TTTrTrAi-r* 
TCAACCCCCG TACATATGAT TGACATGCTA rrrm TTTCTGATTA 

iGACATGCTA GTTTTACGAT TACCGTTCAT CGATTCTCTT 
GTTTCCTCCA GACTCTCAGG CAATCACCTC >nr^ CGATTCTCTT 

ACCCTCTCCG GCATTAATTT ATCAGCTA^ 

ACTGTCTCCG GCCTTTCTCA ^ ™" 

CTTTCTCA CCCTTITCAA TCTTTACCTA CACATTACTC AGGCATTGCA 
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TTTAAAATAT ATCACCCTTC TAAAAATTTT TATCCTTCCC TTCAAATAAA GCCTTCTCCC 
GCAAAACTAT TACACCCTCA TAATGTTTTT CCTACAACCC ATTTACCTTT ATCCTCTCAC 
CCTTTATTCC TTAATTTTCC TAATTCTTTC CCTTCCCTCT ATCATTTATT GCATGTT 
(2) INFORMATION FOR SEQ ID NO: 5: 

(1) SEQUENCE CHARACTERISTICS • 

It! Sf 1 *-" 8 } l f ba " ^« 

a) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: circuit 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO- 5- 
AATCCTACTA CTATTACTAG AATTCATCCC ACCTTTTCAG CTCGCGCCCC AAATCAAAAT 
ATAGCTAAAC AGGTTATTGA CCATTTCCGA AATGTATCTA ATCGTCAAAC TAAATCTACT 
CCTTCGCAGA ATTGGGAATC AACTGTTACA TGGAATGAAA CTTCCACACA CCGTACTTTA 
GTTGCATATT TAAAACATCT TGAGCTACAG CACCACATTC AGCAATTAAC CTCTAAGCCA 
TCTGCAAAAA TGACCTCTTA TCAAAAGGAG CAATTAAAGC TACTCTCTAA TCCTGACCTC 
TTGGAGTTTG CTTCCGGTCT GGTTCGCTTT GAAGCTCGAA TTAAAACGCG ATATTTGAAG 
TCTTTCGGGC TTCCTCTTAA TCTTTTTGAT GCAATCCCCT TTCCTTCTGA CTATAATAGT 
CAGGGTAAAG ACCTGATTTT TGATTTATGC TCATTCTCCT TTTCTGAACT CTTTAAAGCA 
TTTGAGGGGG ATTCAATGAA TATTTATGAC GATTCCCCAG TATTGGACCC TATCCAGTCT 
AAACATTTTA CTATTACCCC CTCTGGCAAA ACTTCTTTTG CAAAAGCCTC TCGCTATTTT 
GGTTTTTATC GTCGTCTGGT AAACGAGGGT TATGATAGTG TTGCTCTTAC TATGCCTCCT 
AATTCCTTTT CCCGTTATGT ATCTGCATTA GTTGAATGTG GTATTCCTAA ATCTCAACTG 
7™ GTACCTCTAA TAATGTTCTT CCCTTAGTTC GTTTTATTAA CGTAGATTTT 
TCTTCCCAAC GTCCTCACTC GTATAATGAG CCAGTTCTTA AAATCGCATA AGCTAATTCA 
CAATCATTAA AGTTGAAATT AAACCATCTC AAGCCCAATT TACTACTCGT TCTGCTGTTT 
CTCGTCACOG CAAGCCTTAT TCACTGAATC AGCAGCTTTC TTACGTTCAT TTGGCTAATG 
AATATCCGGT TCTTGTCAAG ATTACTCTTG ATGAAGGTCA CCCAGCCTAT GCGCCTGGTC 
TGTACACCCT TCATCTCTCC TCTTTCAAAG TTGCTCAGTT CGCTTCCCTT ATGATTGACC 
GTCTGCCCCT CCTTCCGGCT AAGTAACATC GACCAGCTCG CGGATTTCGA CACAATTTAT 
CAGCCCATGA TACAAATCTC CCTTCTACTT TCTTTCCCGC 

CAAAGATGAG TGTTTTAGTG TATTCTTTrv rrt 

" CITTCG CCTCTTTCGT TTTAGGTTGG TGCCTTCGTA 

CT ™ C C<OT — ' CCTC ATCAAAAACT CTTTAGTCCT 

CAAAGCCTCT GTACCCGTTG CTACCCTCGT TCCGATGCTG TCTTTCGCTG CTGAGGCTCA 
CGATCCCGCA AAAGCGGCCT TTAACTCCCT GCAAGCCTCA GCGACCGAAT ATATCGCTTA 
TGCGTGGGCC ATCCTTGTTC TCATTCTCCC CCCAACTATC GGTATCAAGC TCTTTAAGAA 
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ATTGACGTGG AAAGCAAGCT CATAAACCCA TACAATTAAA CGCTCCTITr GGAGGGTTTT 
TTTTTGGAGA TTTTCAACCT GAAAAAATTA TTATTCCCAA TTGGTTTAGT TCTTCCTT7C 
TATTCTCACT CCGGTGAAAG TGTTCAAAGT TGTTTAGCAA AAGGGGATAG AGAAAATTGA 
TTTAGTAAGG TCTGGAAAGA CGACAAAACT TTAGATGGTT ACGGTAAGTA TGAGGGTTGT 
CTGTGGAATG GTAGAGGGGT XGTAGTTTGT AGTGGTGAGG AAAGTGAGTG TTACGGTAGA 
TGCGTTCCTA TTCGGCTTGC IATGGGTGAA AAIGAGGGTG GTGGGTGTGA GGGTGGGGGT 
TGTGAGGGTG GGGGTTGTGA GGGTGGGGGT ACTAAACCTC GTGAGTAGGG TGATAGAGGT 
ATTGGGGGGT ATACTTATAT GAAGGGTGTG GAGGGGAGTT ATCCGCCTCG TAGTGAGGAA 
AAGGGGGGTA ATCCTAATCC TTGTGTTGAG GAGTGTGAGG GTGTTAATAG TTTCATGTTT 
GAGAATAATA GGTTGGGAAA TAGGGAGGGG GGATTAAGTG TTTATACGGG CACTGTTACT 
GAAGGGAGTG ACCCCGTTAA AAGTTATTAG GAGTAGAGTG GTGTATGATG AAAAGGGATG 
TATCACGCTT AGTGGAAGGG TAAATTCAGA GACTGCGCTT TCCATTCTGG CTTTAATGAA 
CATCCATTCC TTTGTGAATA TCAAGGCCAA TGGTGTGAGG TCCCTCAACC TCCTCTCAAT 
GCTGGCGGCG GGTGTGGTGG TGCTTCTCGT GGGGGGTGTG AGCGTGGTGG GTGTGAGGGT 
GGGGGTTGTG AGGGTGGGGG CTCTCAGGGA GGCGGTTCCG CTGGTGGCTC TGGTTCCGGT 
GATTTTGATT ATGAAAACAT GGCAAACGCT AATAAGGGGG GTATGAGGGA AAATGGGGAT 
GAAAACGCGC TAGAGTGTGA CGCTAAAGGC AAACTTGATT GTGTGGCTAG TGATTACCGT 
GCTGCTATCG ATGCTTTCAT TGGTCACGTT TGCGGGGTTG CTAATGGTAA TCGTGCTACT 
GGTGATTTTC CTGGCTCTAA TTCCCAAATC GCTCAAGTCC GTGAGGGTGA TAATTCACCT 
TTAATGAATA ATTTGGGTGA ATATTTACCT TCCCTCCCTC AATGGGTTGA ATGTGGGCGT 
TTTGTGTTTA GGGGTGGTAA ACCATATCAA TTTTCTATTG ATTGTCACAA AATAAACTTA 
TTCCGTGGTG TGTTTGGCTT TCTTTTATAT CTTGCCACCT TTATGTATCT ATTTTCTACG 
TTIGGTAAGA- TACTGCGTAA TAACCAGTCT TAATGATGGG ACTTCTTTTG GCTATTCCGT 
TATTATTGCG TTTCCTCCGT TTCCTTCTGG TAACTTTGTT CCCCTATCTG CTTACTTTTC 
mAAAACGG CTTCGGTAAG ATAGCTATTC CTATTTCATT GTTTCTTCCT CTTATTATTG 
GGCTTAACTC AATTCTTGTG GGTTATCTCT GTGATATTAG CGCTCAATTA CCCTCTGACT 
TOTTGAGa TGTTCACTTA ATTCTCCCCT CTAATGCGCT TCCCTGTTTT TATGTTATTC 
TCTCTCTAAA GGCTGGTATT TTCATTTTTG ACCTTAAACA AAAAATCCTT TCTTATTTGG 
ATTCCGATAA ATAATATGGG TGTTTATTTT GTAAGTGGGA AATTAGGCTC TCGAAAGACG 
CTCCTTAGCG TTCGTAAGAT TCAGGATAAA AITGTAGGTG GGTGCAAAAT AGCAAGTAAT 
CTTCATTTAA CCCTTCAAAA CCTCCCGCAA GTCGGGAGGT TGGCTAAAAC GCCTCGCCTT 
CTTAGAATAC CCGATAAGCC TTGTATATGT CATTTCCTTC CTATTGCCCC CGGTAATCAT 
TCCTACGATG AAAATAAAAA GGGGTTGGTT CTTCTCCATC AGTCGGOTAG TTGGTTTAAT 
ACCCCTTCTT GGAATGATAA CCAAACACAG CCCATTATTG ATTGGTTTCT ACATGCTCGT 
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™ mi ""^ acGAC ™ ™* 

™ Z ;" TCAACA TGTTCTTrAI TCTOTCCTC ic ~ 

™ ' TCTIATTACT GOTCCAAAA T0CCTCTG « TAAATTACAT 

™ ;r cgaitctcaa mAocOT a ™-« ™«m M 

~ Z? IAA GGGAIATGAI ACTAAACACG ^ ™« 

icokxgxxx attcttattt aacgggxxax txaxgagacg cxcggxaxxx GAAAGCAXXA 

MTXTAGGXG ACAACATCAA GCXXAGXAAA ATATATTTCA AAAAGXXXXC AGGCGXX^ 

nBB4mGC ATCA0CAm A " TATACTT ACCTAAGCCG 
™ AGCTAGICTC TGAGACCTAT ° A ™ A "~ TGACTCTTCT 
a" 22 7^ XCCCXAXCXX XXCAAGGAXX CTAAGGGAAA ATTAATTAAT 
AGCGACGATT TACAGAAGCA AGGTTATTCA CTCACATATA TTGATTTAXG XAGTGTTTCG 
ATUAAAAAC GTAATTCAAA TCAAATTGTT AAATGTAATT AATTTTGTTT TCTTCATGTT 
™ A XCXXCXXXXG CXCAGGXAAX XGAAAXGAAX AAXXCGGGXC XGGGGGAXXX 

x™ TATTCAAAGC AICCOTOT «™» — 

XACXGXXAGX GXAXAXXGAX CXGACGXXAA AGGXGAAAAX CXAGGCAAXX XGXXXAXXXG 

™ :? iaait ~ ct TCmc "" — — 

" " T "™ *™ XCXGAXAAXC AGGAAXAXCA 

TCAXAAXXCG GCXCCXXGXG GXGGXXXCXX XGXXCGGGAA MXGAXAAXG XTAGXCAAAC 

T COTC GGGGAAAGGA CTTCTCG - XGXXXCXAAA 

xlTcZ CAAATCTATI ATCTATO " CGCXCXAAXC XAXXAGXXGX 
TAGTCCACCT AAAGATATTT TAGATAACCT irrrrA.^ . 

AArTr „.„ CCT TCCTCAArrC CTTTCTACTG TTCATTTCCC 

AXAXXGAXXG AGGGXXXGAX AXXXGAGGXX CAGCAAGCXG AXGGXXXAGA 

rxmcAXxY GCXCCXGGGX CXGAGCGXGG CACXGXXGGA GGCGGXGXXA AXACXGACGG 
CXXXXAXCXX CTGCXGGXGG XXGGXXCGGX AXXXXXAAXG GGCAXGXXXX 

^TCA CXXCGGGGAX XAAACACXAA XAGCCAXXCA AAAAIAIXCX CXCXGGCACC 

™ — « — X, GGCCACAAIC XCGGXXTXAX 
TACTGCTCGT GTGACTGGTC AATrrr ri-t* 

XCAAAATTTA /v-r AAXCXCCCAA IGXAAAIAAX CCAXXTCAGA CGAXXCAGCG 

TCAAAATGXA GGIAIIIGGA IGAGCCTTTT tj . i 

. TGAGCCXIIX XCCIGXIGCA AIGGCTGCCG CTAAIAXXGX 

XCIGGAIAXI ACCAGCAAGC CCCAXACTTT rx, i u , .. . 

TArr.AT,... "-GGAIAGTXX CAGIXCIICI ACICACGGAA CIGAIGXIAI 

CCGXCGCCTC ™ ^ CXCXXXXAGX 

ACXGAXXAXA AAAACACXXC XCAAGAXXCX CGGGXAGGCX XCGXGXGXAA 

™ C TGmA ° CTC CC0CICTOAT TCC — —CGXX 

T CTCAAAGCAA GGATACTAC0 C0CCCICTA ° ACCGCCGCGC 
OTGXCGXGGX XACGCGGAGC GXGACGGGXA GAGXXGGCAG GGCCGXAGCG CGCGCXCGXX 
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ZZZ l mccm CTCCCCACCT icgcccoot — ~ 

z : ;™ cGAmACTc enMBa ™< ~ 

ATTOGG.GA TuCTTCACCT AGTGGGCCAT CGCGGTGATA GACGGTTITr CGGGCTTTGA 
CCTOGACTC CACGTTCTTT AATAGTCCA,- Tr-rr,— GGCCCTTTGA 
CTATCTCGGG c-irrrJZ TCTTGT7CCA AAGTGGAAGA ACACTCAACC 

TATCTCGGG CiATTCTTTT GATTTATMG GGATTTTGCC GATTTCCGAA CCACCATCAA 
GAGGATTTT CGCCTGCTGG GGGAAACGAG GCTGGAGGGG TTGCTCCAAC TCTCTCAGGC 
CCAGGCCG7C AAGGCCAATC AGCTCTTGCC CGTCTCGCTG OTGAAAAGAA AAACCACCCT 
GGCGCCCAAT ACGGAAAGGG CCTCTCCCCG CGCGTTGGCC GA^AA 
ACCACAGGTT TCCCGACTGC AAAGCGGGCA CTCAGCGCAA CGCAATTAAT CTGAGTTACC 
TCACTGAtTA CGCACCCCAG CCnTAGAGT TTATGCTTCC GGCTCGTATG TTGTCTCGAA 
TTGTGAGCGG ATAACAATTT r*r*^~ i4 »rGTGGAA 
TAGGGGAGGG G^ ATAAIGAMT AGGTATTGGG 

TAGGGGAGGG CCTCGATTCT TATTACTCGC TGCCCAACCA GCGATGGGGG AGCTGTTGGG 
GCCATCTCAT GAGGAGTTGA AATCTCGAAC TGCCTCTGTT GTCTCCCTCC TGAATM^T 
CTATCCCAGA GAGGGGAAAG TACACTGCAA GGTGGATAAC GCCCTCCAAT CGGGTAACTC 
GCAGGAGAGT CTCACACAGC ACGAGAGCAA GGAGAGGAGC XAGAGGCTGA CCAGCACCCT 
GACGCTGAGC AAAGGAGAGT ACGAGAAAGA CAAAGTCtAG GOGXGGCAAG TCACCCATCA 
CGGGGXGAGG TCGCCCCTCA CAAAGAGCTT CAACAGGGGA GAGTGTTCTA GAACGCCTCA 

™ TACMCCTCC ~™ — " 

TTTCTACATG GAGAAAATAA ACTGAAACAA ACGAGTATTG CACTGGCACT CTTACCGTTA 

~ Z A0CCGOTCC — CCCCCTGGCA 

TCCCCCAACC Z TCGGGGCACA GCCGCCCTGC GCTGCCTGGT CAAGACTAAT 

IZZT TCGTC0AACT mcT = 

TCGGGGGIGT CCTACAGTCC TCAGGACTCT AGTGGGTGAG GAGCGTGGTC ACGGTGGGGT 
= CCGGAGCCAG ACCXACA.CX GCAACGTGAA XCACAACCCC l ZZ 

CCCAAAtOT ™" TACCACCTTC 

! W ~ ACCCTCCTAA GCCTGCATTC AATACTTTAC 

ACCCAACTCC TACTGACIAC AITGCCTACC GTTCGGCTAT GCTACTACTT ATAGTTGCTC 
CTACCATAGC CATTAAAXXA TTCAAAAAGT TTACCAGCAA CCCTTCTTAA GCAATACCCA 
AGACCCCCGC ACCGATCGCC crrrrr*^* GCAATACCGA 
TGCCTCTTTT ' ^CCAACA GTTCCGCAGC CTGAATGGCG AATGGCGCTT 

TGCCTGGTTT CCGGCACCAG AAGCGGTGCC GCAAAGCTGG CTGGAGTGGG ATCTTCCTGA 
GGCCGATACG GTCGTCGTCC CCTTaaa^ ATCTTCCTGA 

CC CCTCAAACTG GCAGATGCAC GGTTACGATG CCCCCATCTA 
CACCAACGTA ACCTATCCCA TTACCCTPa a ^™ "^CATCTA 
rrrTTr ™ TTACGGTCAA TCCGCCCTTT GTTCCCACGG ACAATCCCAC 

GGGTTGTTAC TCCCTCACAT tta 1t ^. 

AATTArrrrr r TTAATGTTCA TGAAAGCTGG CTACAGGAAG GCCAGACGCG 

AATTATTTTT GATGGCGTTC mrr^. 

CTATTCGTrA AAAAATGAGC TCATTTAACA AAAATTTAAC 
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GCGAATTTTA ACAAAATATT AACCTTTACA ATTTAAATAI TTCCTTATAC AATCTTCCTG 

mnccccc ttttctcatt atoaagggcg ctacatatga ttgacatggt agttttacga 

TTACCGTTCA TCGATTGTCT TGTTTGCTCC AGACTCTCAG GCAATCACCT GATAGGGTTI 
GTAGAICTGT caaaaatagg iaccctgicc GGCATTAATT tatgaggtag AACGCTTCAA 
TATCATATTG ATGGTGATTT GACTCTCTCC CGCCTTrCTC ACCCTTTTCA ATCTTTACCT 
ACACATTACT GAGGGATTGG ATTTAAAATA TATGAGGGTT CTAAAAATCT TTATGGTTGG 
CTTCAAATAA AGGCTTCTGG CGCAAAAGTA TTACAGGCTC ATAAIGTOT TGGTACAAGG 
GATTTAGCTT TATCCTCTGA GGCTTTAUG CTTAATTTTG CTAATTCTTT GCCTTGCCTC 
TATGATTTAT TGCACGTT 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : sinele 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

18 LOCATION: S&£?f*S5- 

<D> FES^*' >n °"- -S ««««■« EQUAL MIXTUBE 

(lx) FEATURE: 

til Biac - d iff«rence 
(B) LOCATION: replace(6 

<D> rEPg"" >n<> "- ■» «WW«1 EQUAL HIXTURE 

(ix) FEATURE: 

(A) NAME/KEY: misc difference 

(B) LOCATION: repllce(8 

, <D) 0TH 0? A N aT£ I0N: }n ° d - "* ^SENTS EQUAL MIXTURE 

(lx) FEATURE: 

(A) NAME/KEY: also difference 

(B) LOCATION: replSc«<li; ") 

^ G^^ /n0te " " K ***ZSENTS EQUAL MIXTURE 

(lx) FEATURE: 

(A) NAME/KEY: aiac difference 

(B) LOCATION: replace (20, 

( } ^ r^ 10Vl /n ° Ce - ' V REPRESENTS EQUAL MIXTURE 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
AGGTSMARCT KCTCGAGTCW GG 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS • 

(B TYPE: nucleic acid 
C) STRANDEDNESS : ainele 
(D) TOPOLOCY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7 
AGGTCCAGCT GCTCCAGTCT GG 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS • 
(A) LENGTH: 22 base pairs 
B TYPE: nucleic acid 

(C) STRANDEDNESS: sinele 

(D) TOPOLOGY: linear 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 
AGGTCCAGCT CCTCGAGTCA GG 
(2) INFORMATION FOR SEQ ID NO: 9: 

(1) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
AGGTCCAGCT TCTCGAGTCT GG 
(2) INFORMATION FOR SEQ ID NO: 10: 
( i) "SEQUENCE CHARACTERISTICS • 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sinele 

(D) TOPOLOGY: linear* 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
AGGTCCAGCT TCTCGAGTCA GG 
(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS * 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 
AGGTCCAACT GCTCCAGTCT GG 
(2) INFORMATION FOR SEQ ID NO: 12: 

(t) SEQUENCE CHARACTERISTICS ■ 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AGGTCCAACT GCTCGAGTCA GG 
(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS ■ 

«! typI™ 1 22 ? ase P ai " 

(&) TYPE: nucleic acid 
C STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
AGGTCCAACT TCTCGAGTCT GG 
(2) INFORMATION FOR SEQ ID NO: 14: 

( i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 22 base pairs 
(J) TYPE: nucleic acid 

C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
AGGTCCAA<?T TCTCCAGTCA GG 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS- 

iil 22 P«irs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : .inele 

(D) TOPOLOGY: llielr* 

(lx) FEATURE: 

(B) LOCATION : rep lice (5 6 "«n 
(D) OTHER INFORmXtIOn! /Aote- ^-INOSINE- 
(lx) FEATURE: 

(i\ "^-difference 
>5( LOTION: replace<8. 

0THER INFORMATION: /note- "N-INOSINE" 
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(tx) FEATURE: 

(A) NAME/KEY: mlsc difference 
B) LOCATION: wplic."' 

(D) OTHER INFORMATION: /note- "N-INOSINE" 
(ix) FEATURE: 

(A) NAME/KEY: misc difference 
B LOCATION: replace (20, '•) 

<D) OTH o? I N ^ I0N: /nOCe - " W REPRESENTS EQUAL MIXTURE 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
AGGTNNANCT NCTCGAGTCW GG 

22 

(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS- 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CTATTAACTA GTAACGGTAA CAGTGGTGCC TTGCCCCA 38 
(2) INFORMATION FOR SEQ ID NO: 17: 
U) SEQUENCE CHARACTERISTICS- 

(B) TYPE: nucleic acid 

STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AGGCTTACTA GTACAATCCC TGGGCACAAT 30 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS • 
<*> 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : siiutle 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CCAGTTCCGA GCTCGTTGTG ACTCAGGAAT CT 32 
(2) INFORMATION FOR SEQ ID NO: 19: 
(1) SEQUENCE CHARACTERISTICS' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19 
CCAGTTCCGA GCTCGTGTTG ACCCAGCCGC CC 
(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS • 

trl JH E: nucl eic acid 

(C) STRANDEDNESS : s i nR l e 

(D) TOPOLOGY: linear 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
CCAGTTCCGA GCTCGTGCTC ACCCAGTCTC CA 
(2) INFORMATION FOR SEQ ID NO: 21: 
(1) SEQUENCE CHARACTERISTICS- 

(il TYPE^" 32 Pairs 

J2( IH E: nuc leic acid 

(C STRANDEDNESS : sinele 

(D) TOPOLOGY: linear 8 

(xi) SEQUENCE DESCRIPTION: SEQ -ID NO: 21: 
CCAGTTCCGA GCTCCACATC ACCCAGTCTC CA 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS- 
i ^ GTH: 32 b *« Pairs 
( ,r\ IS E: nuc ^ic acid 
<g> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CCACATGTCAOGCTCGTCATC ACCCAGACTC CA 
(2) INFORMATION FOR SEQ ID NO: 23: 
(1) SEQUENCE CHARACTERISTICS- 

*%l H? B: ™cl«lc acid 

(C) STRANDEDNESS * alnvl. 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
CCAGATCTGA GCTCCTCATC ACCCAGTCTC CA 
(2) INFORMATION FOR SEQ ID NO: 24: 
(1) SEQUENCE CHARACTERISTICS • 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS- sin»l. 

(D) TOPOLOGY: liAear 8 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 
CCAGTTCCCA CCTCCTCATC ACACAGTCTC CA 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS- 

til H£ GTH: 32 ba « Pairs 

B TYPE: nucleic acid 

(C) STRAND EDNESS : sinele 

(D) TOPOLOCY: linear 8 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
CCAGCATTCT AGAGTTTCAG CTCCAGCTTG CC 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS • 

til 34 ba " P*i" 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : ainele 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
GCGCCGTCTA GAATTAACAC TCATTCCTGT TGAA 
(2) INFORMATION FOR SEQ ID NO:27: 
(i) SEQUENCE CHARACTERISTICS- 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : sinele 
.•(D) TOPOLOGY: linear 8 

(xi)^ SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GATCCTAGGC TGAAGGCGAT GACCCTGCTA AGGCTGC 
(2) INFORMATION FOR SEQ ID HO: 28: 

til 35 hMa * P«irs 

(B) TYPE: nucleic acid 

<C) STRANDEDNESS: .ingle 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
ATTCAATAGT TTACAGGCAA GTGCTACTCA GTACA 
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(2) INFORMATION FOR SEQ ID NO: 29: 

(t) SEQUENCE CHARACTERISTICS • 
(A) LENGTH: 35 base pairs 
B TYPE: nucleic acid 

(C) STRAND EDNESS : sin*]* 

(D) TOPOLOGY: linear 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29 
TTCGCTACGC TTGGGCTATG GTAGTAGTTA TAGTT 
(2) INFORMATION FOR SEQ ID NO: 30: 
(i) SEQUENCE CHARACTERISTICS ■ 

B TYPE: nucleic acid 
C STRANDEDNESS : single 
(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GGTGCTACCA TAGCGATTAA ATTATTCAAA AAGTT 
(2) INFORMATION FOR SEQ ID NO: 31: .- 

(i) SEQUENCE CHARACTERISTICS ■ 
A) LENGTH: 18 base pairs 
B TYPE: nucleic acid 

(C) STRANDEDNESS: sinele 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 31: 
TACGAGCAAG G CT TCTTA 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) -SEQUENCE CHARACTERISTICS ■ 
A LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : sirurl* 

(D) TOPOLOGY: lW^* 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 
AGCTTAAGAA GCCTTGCTCC TAAACTTTTT GAATAATTT 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS- 
i t^S™ 36 b «« Pairs 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 5 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 
AATCCCTATC GTAGCACCAA CTATAACTAC TACCAT 
(2) INFORMATION FOR SEQ ID NO: 34: 
(i) SEQUENCE CHARACTERISTICS- 

(B) TYPE: nucleic acid 
C STRANDEDNESS : single 
(D) TOPOLOGY : linear 5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
ACCCCAAGCC 7AGCCAATCT ACTCAGTAGC ACTTG 
(2) INFORMATION FOR SEQ ID NO: 35: 

,rl IH E: nuc *eic acid 
(C STRANDEDNESS: sinele 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
CCTGTAAACT ATTGAATGCA GCCTTAGCAG GGTC 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS- 

(B) TYPE: nucleic acid 
(C STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
ATCCCCTTSA GCCTAG 
(2) INFORMATION FOR SEQ ID NO: 37: 
(i) SEQUENCE CHARACTERISTICS • 

/ ( 5 IH B: »«l«ic *cld 

(C) STRANDEDNESS: slncla 

(D) TOPOLOGY: linalr^ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 
CATTTTTGCA GATGGCTTAG A 
(2) INFORMATION FOR SEQ ID NO: 38: 
( i) SEQUENCE CHARACTERISTICS : 

IH E: n«cleic acid 
C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 8 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 
TAGCATTAAC GTCCAATA 
(2) INFORMATION FOR SEQ ID NO: 39: 

(1) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sinele 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
ATATATTTTA GTAAGCTTCA TCTTCT 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS ■ 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sinele 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:40: 
GACAAAGAAC CCCTCAAAAC TTT 
(2) INFORMATION FOR SEQ ID N0:41: 

(i) SEQUENCE CHARACTERISTICS ■ 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sinele 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
GCGGGCCTCT TCGCTATTGC TTAAGAAGCC TTGCT 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS * 
(A) LENGTH: 43 base pai rg 
B) TYPE: nucleic acid 

(C) STRANDEDNESS: sinele 

(D) TOPOLOGY: linear^ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
AAACGACCGC CAGTGCCAAG TCACGCGTCT CAAATTGTTA TCC 
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(2) INFORMATION FOR SEQ ID NO: 43: 

(1) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 8 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 
GGCGAAAGGG AATTCTGCAA GGCGATTAAG CTTGGGTAAC GCC 
(2) INFORMATION FOR SEQ ID NO: 44: 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
GGCGTTACCC AAGCTTTCTA CATGGAGAAA ATAAAG 
(2) INFORMATION FOR SEQ ID NO:45: 

(i) SEQUENCE CHARACTERISTICS ■ 

Si! ^S TH: 42 b **« P*i" 
%l IH E: nuc leic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
TGAAACAAAG CACTATTGCA CTGCCACTCT TACCGTTACC GT 
(2) INFORMATION FOR SEQ ID NO: 46: 

( i ) ' S EQUENCE CHARACTERISTICS ■ 

iil 42 b *»« P«irs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : .ingle 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
TACTGTTTAC CCCTGTGACA AAAGCCCCCC AGGTCCACCT GC 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS ' 

HI ^S™ 1 44 b *«« P«i" 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47 : 
TCGAGTCAGG CCTATTGTCC CCAGCCATTC TACTAGTGGA TCCG 
(2) INFORMATION FOR SEQ ID NO: 48: 

(I) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 
TGGCGAAAGG GAATTCCCAT CCACTAGTAC AATCCCTG 
(2) INFORMATION FOR SEQ ID NO: 49: 
(I) SEQUENCE CHARACTERISTICS • 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sinele 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
GGCACAATAG GCCTGACTCG AGCAGCTGGA CCAGGGCGGC TT 
(2) INFORMATION FOR SEQ ID NO: 50: 

(1) SEQUENCE CHARACTERISTICS • 

Si! 42 base P*i" 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sinele 

(D) TOPOLOCY: linear 5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
TTCTCACACG CCTAAACAGT AACCGTAACC GTAAGTGTGC CA 
(2) INFORMATION FOR SEQ ID NO: 51: 
(i) SEQUENCE CHARACTERISTICS' 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
CTCCAATACT GCTTTGTTTC ACTTTATTTT CTCCATCTAC AA 
(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS' 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) S-QUENCE DESCRIPTION: SEQ ID NO:52: 
TAACGGTAAG AGTGCCAGTG C 
(2) INFORMATION FOR SEQ ID NO: 53: 

(t) SEQUENCE CHARACTERISTICS • 

Si! 32 b «e pairs 

B TYPE: nucleic acid 

C STRANDEDNESS: single 
(D) TOPOLOGY: linear 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53 ; 
CACCTTCATG AATTCGGCAA GGAGACAGTC AT 

(2) INFORMATION FOR SEQ ID NO: 54; ^ 
(i) SEQUENCE CHARACTERISTICS- 

(i\ SF* 2 ? 

,11 IH E: nu deic acid 

(C) STRANDEDNESS: sinzle 

(D) TOPOLOGY: linear 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:54 : 
AATTCGCCAA GGAGACAGTC AT 

(2) INFORMATION FOR SEQ ID NO: 55; " 
(1) SEQUENCE CHARACTERISTICS- 

(B) TYPE: nucleic acid 
(C STRANDEDNESS: sinele 
(D) TOPOLOGY: linear 8 

(xi),> SEQUENCE DESCRIPTION: SEQ ID NO:5 5: 
AATGAAATAC CTATTGCCTA CGCCAGCCGC TGGATTGTT 

(2) INFORMATION FOR SEQ ID NO: 56: " 

(1) SEQUENCE CTARACTERISTICS : 
(A) LENGTH: 39 baae pairs 
(B TYPE: nucleic acfd 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: 
ATTACTCGCT GCCCAACCAG CCATGGCCCA GCTCCTCAT 



39 
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(2) INFORMATION FOR SEQ ID NO: 57; 
(1) SEQUENCE CHARACTERISTICS • 

Vfi; TYPE: nucleic acid 
C) STRANDEDNESS? I&i. 
(D) TOPOLOGY: linear 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:57- 
GACCCAGACT CCAGATATCC AACAGCAATG AGTGTTAAT 
(2) INFORMATION FOR SEQ ID NO: 58: 
(1) SEQUENCE CHARACTERISTICS • 

/£< ~ E: nucleic acid 
C STRAND EDNESS : single 
(D) TOPOLOGY: linear 8 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 58: 
TCTAGAACGC GTC 

(2) INFORMATION FOR SEQ ID NO:59 : ; 

/2; IH E: nu clelc acid 

(C) STRANDEDNESS • sin»1» 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO- 59- 
TTCACGTTGA AGCTTACGCG TTCTAGAATT AACACTCATT CCTGT 
(2) INFORMATION FOR SEQ ID NO: 60: 
(i) SEQUENCE CHARACTERISTICS- 

,nl JH E: "ucleic acid 

(C) STRANDEDNESS • ain^i- 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION SEQ ID N0:6<>. 
TGGATATCTC GAGTCTGGGT CATCACGAGC TCGGCCATG 
(2) INFORMATION FOR SEQ ID NO: 61: 
(i) SEQUENCE CHARACTERISTICS- 

,21 IH E: nucleic acid 

(C) STRANDEDNESS- ain*1« 

(D) TOPOLOGY: llnelr 8 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0 . 6] 
GCTCCTTCCG CACCCAGTAA TAACAATCCA GCCCCTCCC 
(2) INFORMATION FOR SEQ ID NO: 62: 
(i) SEQUENCE CHARACTERISTICS- 

)*{ JH E: nuc leic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO-62 
GTAGGCAATA GGTATTTCAT TATGACTGTC CTTGGCG 
(2) INFORMATION FOR SEQ ID NO: 63: 

(1) S ??^S E CHA RACTERISTICS- 

rrl IH E: nucl eic acid 
(C STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
TGACTGTCTC CTTGGCGTGT GAAATTGTTA 
(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE -CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 
B TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO-64- 
TAACACTCAr" TCCGGATGGA ATTCTGGAGT CTGGGT 
(2) INFORMATION FOR SEQ ID NO: 65: 

(l) S*?***™ CHARACTERISTICS- 
< A > J^CTH: 24 b*,. p«i r8 
(B) TYPE: nucleic acid 
( rnl STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:65: 
GCCAGTCCCA AGTCACGCCT TCTA 
(2) INFORMATION FOR SEQ ID NO:66: 

(i) SEQUENCE CHARACTERISTICS: 
J 2f Jm. pairs 

/2« IH E: nu deic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 8 
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(xi) SEQUENCE DESCRIPTION SEQ ID NO:66: 
ATATATTTTA G7AAGCTTCA TCTTCT 

(2) INFORMATION FOR SEQ ID NO: 67: " 
(i) SEQUENCE CHARACTERISTICS- 

S5? JH E: nuc l«ic acid 

(C) STRANDEDNESS : sinele 

(D) TOPOLOGY: linear 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:67: 
CACAAAGAAC GCGTGAAAAC TTT 

(2) INFORMATION FOR SEQ ID NO:68: " 
(i) SEQUENCE CHARACTERISTICS • 

/2; H£ E: nuc leic acid 

(C) STRANDEDNESS: sinile 

(D) TOPOLOGY: linear 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO-68- 

CTCAACCTCT CTCCCACCAC AGTTGATGCT ATAGGATCAG ATCTAGAATT CATTTAGAGA 60 
CTGGCCTGGC TTCTGC CATTTAGAGA 60 

(2) INFORMATION FOR SEQ ID NO:69: 76 

(1) SEQUENCE CHARACTERISTICS- 
A LENGTH: 80 basejairs 
vo; TYPE: nucleic acid 

(C) STRANDEDNESS: sinele 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO-69- 

TCGACCGTTG GTAGGAATAA TGCAATTAAT GGAGTAGCTC TAAATTCAGA ATTCATCTAC 60 
ACCCAGTCCA TCCAGTAGCT ATTCATCTAC 60 

(2) INFORMATION FOR SEQ ID NO:70: " 
(i) SEQUENCE CHARACTERISTICS • 

iil H? E: nu cUic acid 

(C) STRANDEDNESS • sin»l. 

(D) TOPOLOGY: linear 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:70: 
GGTAAACAGT AACGGTAAGA GTGCCAC 
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(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
CGCCTTCAGC CTAAGAAGCG TAGTCCCCAA CGTCGTACGG GTAGGATCCA 
(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sinele 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
CACCCCTTCC GGGAATTAGT CTTGACCAGG CAGCCCAGGG C 
(2) INFORMATION FOR SEQ ID NO: 73: ; 
(i) SEQUENCE CHARACTERISTICS- 

, (B > IH E: nucl -eic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
ATTCCACACA TTATACCAGC CGGAAGCATA AAGTGTCAAG CCTGGGGTGC C 
(2) INFORMATION FOR SEQ ID NO: 74: 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 42 base oairs 

(B) TYPE: nuclei"ac?d 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
CTGCTCATCA GATGGCGGGA AGAGCTCGGC CATGGCTGGT TG 
(2) INFORMATION FOR SEQ ID NO: 75: 
(i) SEQUENCE CHARACTERISTICS' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sin*l« 

(D) TOPOLOGY: linear 8 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75 
GAACACACTG ACCCAGGCCG CGAGCTCCGC CATGCCTCGT 



